50 research outputs found

    From Stop to Start: Tandem Gene Arrangement, Copy Number and Trans-Splicing Sites in the Dinoflagellate Amphidinium carterae

    Get PDF
    Dinoflagellate genomes present unique challenges including large size, modified DNA bases, lack of nucleosomes, and condensed chromosomes. EST sequencing has shown that many genes are found as many slightly different variants implying that many copies are present in the genome. As a preliminary survey of the genome our goal was to obtain genomic sequences for 47 genes from the dinoflagellate Amphidinium carterae. A PCR approach was used to avoid problems with large insert libraries. One primer set was oriented inward to amplify the genomic complement of the cDNA and a second primer set would amplify outward between tandem repeats of the same gene. Each gene was also tested for a spliced leader using cDNA as template. Almost all (14/15) of the highly expressed genes (i.e. those with high representation in the cDNA pool) were shown to be in tandem arrays with short intergenic spacers, and most were trans-spliced. Only two moderately expressed genes were found in tandem arrays. A polyadenylation signal was found in genomic copies containing the sequence AAAAG/C at the exact polyadenylation site and was conserved between species. Four genes were found to have a high intron density (>5 introns) while most either lacked introns, or had only one to three. Actin was selected for deeper sequencing of both genomic and cDNA copies. Two clusters of actin copies were found, separated from each other by many non-coding features such as intron size and sequence. One intron-rich gene was selected for genomic walking using inverse PCR, and was not shown to be in a tandem repeat. The first glimpse of dinoflagellate genome indicates two general categories of genes in dinoflagellates, a highly expressed tandem repeat class and an intron rich less expressed class. This combination of features appears to be unique among eukaryotes

    Comparative Genomic and Transcriptomic Characterization of the Toxigenic Marine Dinoflagellate Alexandrium ostenfeldii

    Get PDF
    Many dinoflagellate species are notorious for the toxins they produce and ecological and human health consequences associated with harmful algal blooms (HABs). Dinoflagellates are particularly refractory to genomic analysis due to the enormous genome size, lack of knowledge about their DNA composition and structure, and peculiarities of gene regulation, such as spliced leader (SL) trans-splicing and mRNA transposition mechanisms. Alexandrium ostenfeldii is known to produce macrocyclic imine toxins, described as spirolides. We characterized the genome of A. ostenfeldii using a combination of transcriptomic data and random genomic clones for comparison with other dinoflagellates, particularly Alexandrium species. Examination of SL sequences revealed similar features as in other dinoflagellates, including Alexandrium species. SL sequences in decay indicate frequent retro-transposition of mRNA species. This probably contributes to overall genome complexity by generating additional gene copies. Sequencing of several thousand fosmid and bacterial artificial chromosome (BAC) ends yielded a wealth of simple repeats and tandemly repeated longer sequence stretches which we estimated to comprise more than half of the whole genome. Surprisingly, the repeats comprise a very limited set of 79–97 bp sequences; in part the genome is thus a relatively uniform sequence space interrupted by coding sequences. Our genomic sequence survey (GSS) represents the largest genomic data set of a dinoflagellate to date. Alexandrium ostenfeldii is a typical dinoflagellate with respect to its transcriptome and mRNA transposition but demonstrates Alexandrium-like stop codon usage. The large portion of repetitive sequences and the organization within the genome is in agreement with several other studies on dinoflagellates using different approaches. It remains to be determined whether this unusual composition is directly correlated to the exceptionally genome organization of dinoflagellates with a low amount of histones and histone-like proteins

    Distinct Gene Number-Genome Size Relationships for Eukaryotes and Non-Eukaryotes: Gene Content Estimation for Dinoflagellate Genomes

    Get PDF
    The ability to predict gene content is highly desirable for characterization of not-yet sequenced genomes like those of dinoflagellates. Using data from completely sequenced and annotated genomes from phylogenetically diverse lineages, we investigated the relationship between gene content and genome size using regression analyses. Distinct relationships between log10-transformed protein-coding gene number (Y′) versus log10-transformed genome size (X′, genome size in kbp) were found for eukaryotes and non-eukaryotes. Eukaryotes best fit a logarithmic model, Y′ = ln(-46.200+22.678X′, whereas non-eukaryotes a linear model, Y′ = 0.045+0.977X′, both with high significance (p<0.001, R2>0.91). Total gene number shows similar trends in both groups to their respective protein coding regressions. The distinct correlations reflect lower and decreasing gene-coding percentages as genome size increases in eukaryotes (82%–1%) compared to higher and relatively stable percentages in prokaryotes and viruses (97%–47%). The eukaryotic regression models project that the smallest dinoflagellate genome (3×106 kbp) contains 38,188 protein-coding (40,086 total) genes and the largest (245×106 kbp) 87,688 protein-coding (92,013 total) genes, corresponding to 1.8% and 0.05% gene-coding percentages. These estimates do not likely represent extraordinarily high functional diversity of the encoded proteome but rather highly redundant genomes as evidenced by high gene copy numbers documented for various dinoflagellate species

    Identifying and Characterizing Alternative Molecular Markers for the Symbiotic and Free-Living Dinoflagellate Genus Symbiodinium

    Get PDF
    Dinoflagellates in the genus Symbiodinium are best known as endosymbionts of corals and other invertebrate as well as protist hosts, but also exist free-living in coastal environments. Despite their importance in marine ecosystems, less than 10 loci have been used to explore phylogenetic relationships in this group, and only the multi-copy nuclear ribosomal Internal Transcribed Spacer (ITS) regions 1 and 2 have been used to characterize fine-scale genetic diversity within the nine clades (A–I) that comprise the genus. Here, we describe a three-step molecular approach focused on 1) identifying new candidate genes for phylogenetic analysis of Symbiodinium spp., 2) characterizing the phylogenetic relationship of these candidate genes from DNA samples spanning eight Symbiodinium clades (A–H), and 3) conducting in-depth phylogenetic analyses of candidate genes displaying genetic divergences equal or higher than those within the ITS-2 of Symbiodinium clade C. To this end, we used bioinformatics tools and reciprocal comparisons to identify homologous genes from 55,551 cDNA sequences representing two Symbiodinium and six additional dinoflagellate EST libraries. Of the 84 candidate genes identified, 7 Symbiodinium genes (elf2, coI, coIII, cob, calmodulin, rad24, and actin) were characterized by sequencing 23 DNA samples spanning eight Symbiodinium clades (A–H). Four genes displaying higher rates of genetic divergences than ITS-2 within clade C were selected for in-depth phylogenetic analyses, which revealed that calmodulin has limited taxonomic utility but that coI, rad24, and actin behave predictably with respect to Symbiodinium lineage C and are potential candidates as new markers for this group. The approach for targeting candidate genes described here can serve as a model for future studies aimed at identifying and testing new phylogenetically informative genes for taxa where transcriptomic and genomics data are available

    Discovery of Nuclear-Encoded Genes for the Neurotoxin Saxitoxin in Dinoflagellates

    Get PDF
    Saxitoxin is a potent neurotoxin that occurs in aquatic environments worldwide. Ingestion of vector species can lead to paralytic shellfish poisoning, a severe human illness that may lead to paralysis and death. In freshwaters, the toxin is produced by prokaryotic cyanobacteria; in marine waters, it is associated with eukaryotic dinoflagellates. However, several studies suggest that saxitoxin is not produced by dinoflagellates themselves, but by co-cultured bacteria. Here, we show that genes required for saxitoxin synthesis are encoded in the nuclear genomes of dinoflagellates. We sequenced >1.2×106 mRNA transcripts from the two saxitoxin-producing dinoflagellate strains Alexandrium fundyense CCMP1719 and A. minutum CCMP113 using high-throughput sequencing technology. In addition, we used in silico transcriptome analyses, RACE, qPCR and conventional PCR coupled with Sanger sequencing. These approaches successfully identified genes required for saxitoxin-synthesis in the two transcriptomes. We focused on sxtA, the unique starting gene of saxitoxin synthesis, and show that the dinoflagellate transcripts of sxtA have the same domain structure as the cyanobacterial sxtA genes. But, in contrast to the bacterial homologs, the dinoflagellate transcripts are monocistronic, have a higher GC content, occur in multiple copies, contain typical dinoflagellate spliced-leader sequences and eukaryotic polyA-tails. Further, we investigated 28 saxitoxin-producing and non-producing dinoflagellate strains from six different genera for the presence of genomic sxtA homologs. Our results show very good agreement between the presence of sxtA and saxitoxin-synthesis, except in three strains of A. tamarense, for which we amplified sxtA, but did not detect the toxin. Our work opens for possibilities to develop molecular tools to detect saxitoxin-producing dinoflagellates in the environment

    Genome Fragmentation Is Not Confined to the Peridinin Plastid in Dinoflagellates

    Get PDF
    When plastids are transferred between eukaryote lineages through series of endosymbiosis, their environment changes dramatically. Comparison of dinoflagellate plastids that originated from different algal groups has revealed convergent evolution, suggesting that the host environment mainly influences the evolution of the newly acquired organelle. Recently the genome from the anomalously pigmented dinoflagellate Karlodinium veneficum plastid was uncovered as a conventional chromosome. To determine if this haptophyte-derived plastid contains additional chromosomal fragments that resemble the mini-circles of the peridin-containing plastids, we have investigated its genome by in-depth sequencing using 454 pyrosequencing technology, PCR and clone library analysis. Sequence analyses show several genes with significantly higher copy numbers than present in the chromosome. These genes are most likely extrachromosomal fragments, and the ones with highest copy numbers include genes encoding the chaperone DnaK(Hsp70), the rubisco large subunit (rbcL), and two tRNAs (trnE and trnM). In addition, some photosystem genes such as psaB, psaA, psbB and psbD are overrepresented. Most of the dnaK and rbcL sequences are found as shortened or fragmented gene sequences, typically missing the 3′-terminal portion. Both dnaK and rbcL are associated with a common sequence element consisting of about 120 bp of highly conserved AT-rich sequence followed by a trnE gene, possibly serving as a control region. Decatenation assays and Southern blot analysis indicate that the extrachromosomal plastid sequences do not have the same organization or lengths as the minicircles of the peridinin dinoflagellates. The fragmentation of the haptophyte-derived plastid genome K. veneficum suggests that it is likely a sign of a host-driven process shaping the plastid genomes of dinoflagellates

    Phylogenomics Reshuffles the Eukaryotic Supergroups

    Get PDF
    Background. Resolving the phylogenetic relationships between eukaryotes is an ongoing challenge of evolutionary biology. In recent years, the accumulation of molecular data led to a new evolutionary understanding, in which all eukaryotic diversity has been classified into five or six supergroups. Yet, the composition of these large assemblages and their relationships remain controversial. Methodology/Principle Findings. Here, we report the sequencing of expressed sequence tags (ESTs) for two species belonging to the supergroup Rhizaria and present the analysis of a unique dataset combining 29908 amino acid positions and an extensive taxa sampling made of 49 mainly unicellular species representative of all supergroups. Our results show a very robust relationship between Rhizaria and two main clades of the supergroup chromalveolates: stramenopiles and alveolates. We confirm the existence of consistent affinities between assemblages that were thought to belong to different supergroups of eukaryotes, thus not sharing a close evolutionary history. Conclusions. This well supported phylogeny has important consequences for our understanding of the evolutionary history of eukaryotes. In particular, it questions a single red algal origin of the chlorophyll-c containing plastids among the chromalveolates. We propose the abbreviated name ‘SAR’ (Stramenopiles+Alveolates+Rhizaria) to accommodate this new super assemblage of eukaryotes, which comprises the largest diversity of unicellular eukaryotes

    Improved Resolution of Reef-Coral Endosymbiont (Symbiodinium) Species Diversity, Ecology, and Evolution through psbA Non-Coding Region Genotyping

    Get PDF
    Ribosomal DNA sequence data abounds from numerous studies on the dinoflagellate endosymbionts of corals, and yet the multi-copy nature and intragenomic variability of rRNA genes and spacers confound interpretations of symbiont diversity and ecology. Making consistent sense of extensive sequence variation in a meaningful ecological and evolutionary context would benefit from the application of additional genetic markers. Sequences of the non-coding region of the plastid psbA minicircle (psbAncr) were used to independently examine symbiont genotypic and species diversity found within and between colonies of Hawaiian reef corals in the genus Montipora. A single psbAncr haplotype was recovered in most samples through direct sequencing (∼80–90%) and members of the same internal transcribed spacer region 2 (ITS2) type were phylogenetically differentiated from other ITS2 types by substantial psbAncr sequence divergence. The repeated sequencing of bacterially-cloned fragments of psbAncr from samples and clonal cultures often recovered a single numerically common haplotype accompanied by rare, highly-similar, sequence variants. When sequence artifacts of cloning and intragenomic variation are factored out, these data indicate that most colonies harbored one dominant Symbiodinium genotype. The cloning and sequencing of ITS2 DNA amplified from these same samples recovered numerically abundant variants (that are diagnostic of distinct Symbiodinium lineages), but also generated a large amount of sequences comprising PCR/cloning artifacts combined with ancestral and/or rare variants that, if incorporated into phylogenetic reconstructions, confound how small sequence differences are interpreted. Finally, psbAncr sequence data from a broad sampling of Symbiodinium diversity obtained from various corals throughout the Indo-Pacific were concordant with ITS lineage membership (defined by denaturing gradient gel electrophoresis screening), yet exhibited substantially greater sequence divergence and revealed strong phylogeographic structure corresponding to major biogeographic provinces. The detailed genetic resolution provided by psbAncr data brings further clarity to the ecology, evolution, and systematics of symbiotic dinoflagellates

    Evolutionary distinctiveness of fatty acid and polyketide synthesis in eukaryotes

    Get PDF
    © 2016 International Society for Microbial Ecology All rights reserved. Fatty acids, which are essential cell membrane constituents and fuel storage molecules, are thought to share a common evolutionary origin with polyketide toxins in eukaryotes. While fatty acids are primary metabolic products, polyketide toxins are secondary metabolites that are involved in ecologically relevant processes, such as chemical defence, and produce the adverse effects of harmful algal blooms. Selection pressures on such compounds may be different, resulting in differing evolutionary histories. Surprisingly, some studies of dinoflagellates have suggested that the same enzymes may catalyse these processes. Here we show the presence and evolutionary distinctiveness of genes encoding six key enzymes essential for fatty acid production in 13 eukaryotic lineages for which no previous sequence data were available (alveolates: dinoflagellates, Vitrella, Chromera; stramenopiles: bolidophytes, chrysophytes, pelagophytes, raphidophytes, dictyochophytes, pinguiophytes, xanthophytes; Rhizaria: chlorarachniophytes, haplosporida; euglenids) and 8 other lineages (apicomplexans, bacillariophytes, synurophytes, cryptophytes, haptophytes, chlorophyceans, prasinophytes, trebouxiophytes). The phylogeny of fatty acid synthase genes reflects the evolutionary history of the organism, indicating selection to maintain conserved functionality. In contrast, polyketide synthase gene families are highly expanded in dinoflagellates and haptophytes, suggesting relaxed constraints in their evolutionary history, while completely absent from some protist lineages. This demonstrates a vast potential for the production of bioactive polyketide compounds in some lineages of microbial eukaryotes, indicating that the evolution of these compounds may have played an important role in their ecological success

    Evolution of light-harvesting complex proteins from Chl c-containing algae

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Light harvesting complex (LHC) proteins function in photosynthesis by binding chlorophyll (Chl) and carotenoid molecules that absorb light and transfer the energy to the reaction center Chl of the photosystem. Most research has focused on LHCs of plants and chlorophytes that bind Chl <it>a </it>and <it>b </it>and extensive work on these proteins has uncovered a diversity of biochemical functions, expression patterns and amino acid sequences. We focus here on a less-studied family of LHCs that typically bind Chl <it>a </it>and <it>c</it>, and that are widely distributed in Chl <it>c</it>-containing and other algae. Previous phylogenetic analyses of these proteins suggested that individual algal lineages possess proteins from one or two subfamilies, and that most subfamilies are characteristic of a particular algal lineage, but genome-scale datasets had revealed that some species have multiple different forms of the gene. Such observations also suggested that there might have been an important influence of endosymbiosis in the evolution of LHCs.</p> <p>Results</p> <p>We reconstruct a phylogeny of LHCs from Chl <it>c</it>-containing algae and related lineages using data from recent sequencing projects to give ~10-fold larger taxon sampling than previous studies. The phylogeny indicates that individual taxa possess proteins from multiple LHC subfamilies and that several LHC subfamilies are found in distantly related algal lineages. This phylogenetic pattern implies functional differentiation of the gene families, a hypothesis that is consistent with data on gene expression, carotenoid binding and physical associations with other LHCs. In all probability LHCs have undergone a complex history of evolution of function, gene transfer, and lineage-specific diversification.</p> <p>Conclusion</p> <p>The analysis provides a strikingly different picture of LHC diversity than previous analyses of LHC evolution. Individual algal lineages possess proteins from multiple LHC subfamilies. Evolutionary relationships showed support for the hypothesized origin of Chl <it>c </it>plastids. This work also allows recent experimental findings about molecular function to be understood in a broader phylogenetic context.</p
    corecore